The Pennsylvania State University, Spring 2021 Stat 415-001, Hyebin Song

Interval Estimation

Interval EstimationIntroduction to Interval EstimationLearning objectivesInterval estimators and interval estimatesSteps to construct an interval estimator for $\theta$ using a pivotal quantityConfidence intervals for one mean Learning objectivesConfidence intervals for one meanSummaryConfidence Intervals for the difference of two meansLearning objectivesTwo independent random samples vs a paired random sampleConfidence intervals for the difference of two meansSummaryConfidence intervals for proportionsLearning objectivesConfidence interval for one proportionConfidence interval for the difference of proportions

Introduction to Interval Estimation

Learning objectives

Understand the concepts of interval estimators, confidence coefficients, and and interval estimates
$1-\alpha$ confidence intervals.

Interval estimators and interval estimates

Definitions
- $\theta$ random $[l(X_1,\dots,X_n),u(X_1,\dots,X_n)]$ $l(X_1,\dots,X_n)\leq u(X_1,\dots,X_n)$ $\theta$ .
  - $l(\cdot)$ $u(\cdot)$ $[l(X_1,\dots,X_n),u(X_1,\dots,X_n)]$ $\theta$ $100(1-\alpha)$ % times. In other words,
$P_\theta(l(X_1,\dots,X_n)\leq \theta \leq u(X_1,\dots,X_n)) \geq 1-\alpha$
- $1-\alpha$ is called the confidence coefficient .
- $\theta$ observed $[l(x_1,\dots,x_n),u(x_1,\dots,x_n)]$ .
  - $1-\alpha$ $\theta$ $100(1-\alpha)$ % $\theta$ .

Remark
- fixed $\theta$ $1$ $0$ (parameter is not in the interval). If we construct many 95% confidence intervals based on many random samples, then we can expect that 95% of the realized intervals would contain the parameter.
- $\theta$ cannot $\theta$ $(l(x),~ u(x))$ with 95% chance.

$\theta$ using a pivotal quantity

$l(X_1,\dots,X_n)$ $u(X_1,\dots,X_n)$ such that

P_\theta(l(X_1,\dots,X_n) \leq \theta \leq u(X_1,\dots,X_n)) = 1-\alpha,

$0<1-\alpha<1$ .

Basic idea: $\widehat{\theta}$ $\theta$ to decide a "margin" around the point estimate.

$\hat{\theta}$ $\epsilon_n$ $\theta$ $\theta$ $\epsilon_n$ $\hat{\theta}$ .

$\epsilon_n$ $P(|\hat{\theta}-\theta|\leq \epsilon_n) = 1-\alpha$ $P(\hat{\theta} - \epsilon_n \le \theta \le \hat{\theta} +\epsilon_n) = 1-\alpha.$

Example $(X_1,\dots,X_n)$ $X_i \sim N(\mu,2^2)$ .

$\bar{X}$ $\mu$ $\bar{X}$ $N(\mu,\frac{4}{n})$ .

$\bar{X}$ $2\sigma = \frac{4}{\sqrt{n}}$ $\mu$ $=95.45$ %.

$\mu$ $2\sigma = \frac{4}{\sqrt{n}}$ $\bar{X}$ $=95.45$ %.

$\mu$ $(\bar{X}-2\frac{2}{\sqrt{n}},\bar{X} +2 \frac{2}{\sqrt{n}})$

$0.9545$ .

$(x_1,\dots,x_n)$ $(X_1,\dots,X_n)$ $\mu$ $(\bar{x}-2 \frac{2}{\sqrt{n}},\bar{x} + 2\frac{2}{\sqrt{n}})$ .

Here are general steps:

$\widehat{\theta}$ $\theta$ .
- Often, the use of "good" estimators results in good confidence intervals
$\widehat{\theta}$ $\theta$ .
- In particular, we use one of the following three methods:
  - $\widehat{\theta}$
  - $\widehat{\theta}$
  - $\widehat{\theta}$
- $\widehat{\theta}$ $\theta$ $f(\widehat{\theta}, \theta)$ , whose distribution is known.
  - $f(\widehat{\theta},\theta)$ pivotal quantity $f$ $\theta$ ).
$\epsilon_n$ $P(|\hat{\theta}-\theta|\leq \epsilon_n) = 1-\alpha$ $\hat{\theta}$ $f(\hat{\theta},\theta)$ .
$1-\alpha$ $[\hat{\theta}-\epsilon_n, \hat{\theta}+\epsilon_n]$ .

Confidence intervals for one mean

Learning objectives

$1-\alpha$ $\mu$ when we have
1. $(X_1,\dots,X_n)$ $N(\mu,\sigma^2)$ $\sigma^2$
2. $(X_1,\dots,X_n)$ $N(\mu,\sigma^2)$ $\sigma^2$
3. a random sample from an unknown distribution but when we have a large sample size

Confidence intervals for one mean

$\mu$ $X_1, \ldots, X_n\sim N(\mu, \sigma^2)$ $\sigma^2$ known.

Example: $X$ $X$ $N(\mu, 1296)$ $n = 27$ $\bar{x} = 1478$ $\mu$ .

$1-\alpha$ $l(X_1,\dots,X_n)$ $u(X_1,\dots,X_n)$ such that

P_\mu(l(X_1,\dots,X_n) \leq \mu \leq u(X_1,\dots,X_n)) = 1-\alpha.

$[l(x_1,\dots,x_n),u(x_1,\dots,x_n)]$ $1-\alpha$ $\mu$ $l(X_1,\dots,X_n)$ $u(X_1,\dots,X_n)$ ?

We follow the four steps to construct an interval estimator:

$\bar{X} = \frac{1}{n}\sum_{i=1}^n X_i$ $\mu$ .
$X_i \sim N(\mu,\sigma^2)$ $\bar{X} \sim N(\mu, \frac{\sigma^2}{n})$ .
$f(\bar{X},\mu) = \frac{\bar{X}-\mu} {\sigma/\sqrt{n}} \sim N(0,1)$ $\sigma^2$ is known).
$\epsilon_n$ $P(|\bar{X}-\mu|\leq \epsilon_n) = 1-\alpha$ $z_{\alpha/2}$ such that

P(|\frac{\bar{X}-\mu}{\sigma/\sqrt{n}}| \leq z_{\alpha/2} )= P(-z_{\alpha/2}\leq \frac{\bar{X}-\mu}{\sigma/\sqrt{n}} \leq z_{\alpha/2}) = 1-\alpha

$z_{\alpha}$ $\alpha$ $P(Z>z_{\alpha}) = \alpha$ .
- $z_{\alpha}$ in Table V in Appendix B of HTZ.
- $z_\alpha$ $z_{0.025}$ in R, you can run
```
qnorm(0.975,mean = 0,sd = 1) #0.975(=1-0.025) quantile of the standard normal dist
```
  $z_{0.025} = 1.96$ .
$P(|\bar{X}-\mu|\leq z_{\alpha/2} \frac{\sigma}{\sqrt{n}}) = 1-\alpha$ .

$P(|\mu-\bar{X}|\leq z_{\alpha/2} \frac{\sigma}{\sqrt{n}}) = 1-\alpha$ .

$\mu$ $\alpha$ $[\bar{X}-z_{\alpha/2}\frac{\sigma}{\sqrt{n}},\bar{X} +z_{\alpha/2}\frac{\sigma}{\sqrt{n}}]$ .
$1-\alpha$ $\mu$ $[\bar{x}-z_{\alpha/2}\frac{\sigma}{\sqrt{n}},\bar{x} +z_{\alpha/2}\frac{\sigma}{\sqrt{n}}]$ .

Therefore, 90% and 95% confidence intervals for the average length of life of a 60-watt light bulb are

$\alpha = 0.1$ $[\bar{x}-z_{\alpha/2}\frac{\sigma}{\sqrt{n}},\bar{x} +z_{\alpha/2}\frac{\sigma}{\sqrt{n}}] = [1478 - z_{0.05}\frac{36}{\sqrt{27}},1478 + z_{0.05}\frac{36}{\sqrt{27}}]=[1466.6, 1489.4].$
$\alpha = 0.05$ $[\bar{x}-z_{\alpha/2}\frac{\sigma}{\sqrt{n}},\bar{x} +z_{\alpha/2}\frac{\sigma}{\sqrt{n}}] = [1478 - z_{0.025}\frac{36}{\sqrt{27}},1478 + z_{0.025}\frac{36}{\sqrt{27}}]=[1464.4, 1491.6].$

Remark In practice, one can never "know" whether the data are from a normal distribution or not. Often, people perform some sorts of normality tests to determine if an observed data set can be well-modeled by a normal distribution. One quick but powerful test is to plot a histogram of the observed data and see whether the shape of histogram resembles a bell-curve.

$\mu$ $X_1, \ldots, X_n\sim N(\mu, \sigma^2)$ $\sigma^2$ unknown.

Example: $X$ $X$ $n = 27$ $\bar{x} = 1478$ $s = 36$ $\mu$ .

$\mu$ $\sigma^2$ is unknown. Following 4 steps,

$\bar{X}$ $\mu$ .
$X_i \sim N(\mu,\sigma^2)$ $\bar{X} \sim N(\mu, \frac{\sigma^2}{n})$ .
$\frac{\bar{X}-\mu} {\sigma/\sqrt{n}} \sim N(0,1)$ $\bar{X}$ $\mu$ $\sigma^2$ $\bar{X}$ $\mu$ $\sigma^2$ $S^2 = \frac{1}{n-1}\sum_{i=1}^n (X_i - \bar{X})^2$ .
$f(\bar{X},\mu) = \frac{\bar{X}-\mu}{S/\sqrt{n}}$ .

Lemma: $\frac{\bar{X}-\mu} {S/\sqrt{n}} \sim T(n-1)$ $n-1$ .
proof.
$Z \sim N(0,1)$ $U \sim \chi^2(d)$ $Z$ $U$ $\frac{Z}{\sqrt{U/d}} \sim T(d)$ .

$X_1,\dots,X_n \sim N(\mu,\sigma^2)$ i.i.d., we have,
1. $\bar{X}$ $\sum_{i=1}^n (X_i - \bar{X})^2$ are independent.
2. $\sum_{i=1}^n (\frac{X_i - \bar{X}}{\sigma})^2 \sim \chi^2(n-1)$
$(n-1)S^2/\sigma^2 \sim \chi^2(n-1)$ .

Combining two theorems,
$\frac{\bar{X}-\mu} {S/\sqrt{n}} = \frac{(\bar{X}-\mu)/(\sigma/\sqrt{n})} {\sqrt{ \frac{ (n-1)S^2/\sigma^2}{n-1}}} \sim T(n-1)$
$t_{\alpha}(n-1)$ such that
$P(-t_{\alpha}(n-1)\leq \frac{\bar{X}-\mu}{S/\sqrt{n}} \leq t_{\alpha}(n-1)) = 1-\alpha$
- $t_{\alpha}(r)$ $\alpha$ $r$ $P(T>t_{\alpha}(r)) = \alpha$ .
  - $t_{\alpha}(r)$ $\alpha$ values.
  - $t_{\alpha}(r)$ . For example, the script
```
xxxxxxxxxx
qt(0.975,df = 10) #0.975(=1-0.025) quantile of the t dist with df = 10
```
    $t_{0.025}(10) = 2.23$ .
$P(|\mu - \bar{X}|\le t_{\alpha}(n-1) \frac{S}{\sqrt{n}}) = 1-\alpha$ . In other words,

P(\bar{X}-t_{\alpha/2}(n-1)\frac{S}{\sqrt{n}}\leq\mu \leq -t_{\alpha/2}(n-1)\frac{S}{\sqrt{n}}) = 1-\alpha.

$\mu$ $\alpha$ $[\bar{X}-t_{\alpha/2}(n-1)\frac{S}{\sqrt{n}},\bar{X} +t_{\alpha/2}(n-1)\frac{S}{\sqrt{n}}]$ .
$1-\alpha$ $\mu$ $[\bar{x}-t_{\alpha/2}(n-1)\frac{S}{\sqrt{n}},\bar{x} +t_{\alpha/2}(n-1)\frac{S}{\sqrt{n}}]$ .

Therefore, 90% and 95% confidence intervals for the average length of life of a 60-watt light bulb are

$\alpha = 0.1$ ):
$[\bar{x}-t_{\alpha/2}(n-1)\frac{s}{\sqrt{n}},\bar{x} +t_{\alpha/2}(n-1)\frac{s}{\sqrt{n}}] = [1478 - t_{0.05}(26)\frac{36}{\sqrt{27}},1478 + t_{0.05}(26)\frac{36}{\sqrt{27}}]=[1466.2,1489.8].$
- In R, qt(0.95,df=26) returns 1.70
$\alpha = 0.05$ ):
$[\bar{x}-t_{\alpha/2}(n-1)\frac{s}{\sqrt{n}},\bar{x} +t_{\alpha/2}(n-1)\frac{s}{\sqrt{n}}] = [1478 - t_{0.025}(26)\frac{36}{\sqrt{27}},1478 + t_{0.025}(26)\frac{36}{\sqrt{27}}]=[1463.8,1492.2].$
- In R, qt(0.975,df=26) returns 2.01

Remark: $t_{0.05}(26)\geq z_{0.05}$ $t_{0.025}(26)\geq z_{0.025}$ $\sigma^2$ longer $\sigma^2$ $T(26)$ distribution has a thicker probability tail than the standard normal distribution.

$T$ $T$ distribution is quite similar to the standard normal distribution.

$\mu$ $n$

Example: $X$ $n = 100$ $\bar{x} = 1478$ ${\rm Var}(X) = 36$ $\mu$ .

$X_i$ $\bar{X}$ $n=100$ ), and thus we can use the CLT to obtain

\frac{\bar{X}-\mu} {\sigma/\sqrt{n}} \dot\sim N(0,1) \tag{1}

Note: $X_i$ $\bar{X}$ . In such cases, we can use this normal approximation instead.

From (1), we have,

P(-z_{\alpha/2}\leq \frac{\bar{X}-\mu}{\sigma/\sqrt{n}} \leq z_{\alpha/2}) \approx 1-\alpha

Then,

P(|\mu- \bar{X}|\le z_{\alpha/2}\frac{\sigma}{\sqrt{n}}) \approx 1-\alpha.

Thus we have,

$\mu$ $\alpha$ $[\bar{X}-z_{\alpha/2}\frac{\sigma}{\sqrt{n}},\bar{X} +z_{\alpha/2}\frac{\sigma}{\sqrt{n}}]$ .
$1-\alpha$ $\mu$ $[\bar{x}-z_{\alpha/2}\frac{\sigma}{\sqrt{n}},\bar{x} +z_{\alpha/2}\frac{\sigma}{\sqrt{n}}]$ .

Remark: $100(1-\alpha)$ $n$ $P_\mu(l(X_1,\dots,X_n) \leq \mu \leq u(X_1,\dots,X_n)) \geq 1-\alpha$ ,

$n$ is large ("asymptotic").

Therefore, approximate 90% and 95% confidence intervals for the average length of life of a 60-watt light bulb are

$\alpha = 0.1$ $[\bar{x}-z_{\alpha/2}\frac{\sigma}{\sqrt{n}},\bar{x} +z_{\alpha/2}\frac{\sigma}{\sqrt{n}}] = [1478 - z_{0.05}\frac{36}{\sqrt{100}},1478 + z_{0.05}\frac{36}{\sqrt{100}}]=[1472.1, 1483.9].$
$\alpha = 0.05$ $[\bar{x}-z_{\alpha/2}\frac{\sigma}{\sqrt{n}},\bar{x} +z_{\alpha/2}\frac{\sigma}{\sqrt{n}}] = [1478 - z_{0.025}\frac{36}{\sqrt{27}},1478 + z_{0.025}\frac{36}{\sqrt{27}}]=[1470.9, 1485.1].$

$\sigma^2=36$ $\sigma^2$ $S^2$ and use the following approximation:

\frac{\bar{X}-\mu} {S/\sqrt{n}} \dot\sim N(0,1)

$S^2 \approx \sigma^2$ $n$ $S^2$ $\sigma^2$ .)

Then, following similar steps as before, we have,

$\mu$ $\alpha$ $[\bar{X}-z_{\alpha/2}\frac{S}{\sqrt{n}},\bar{X} +z_{\alpha/2}\frac{S}{\sqrt{n}}]$ .
$1-\alpha$ $\mu$ $[\bar{x}-z_{\alpha/2}\frac{s}{\sqrt{n}},\bar{x} +z_{\alpha/2}\frac{s}{\sqrt{n}}]$ .

In particular, we can use the same approximate 90% and 95% confidence intervals for the average length of life of a 60-watt light bulb even though the population variance is replaced with the sample variance.

Summary

$(x_1,\dots,x_n)$ $(X_1,\dots,X_n)$ ,

Settings	$1-\alpha$ confidence interval
$X_i \sim N(\mu,\sigma^2)$ $\sigma^2$ known	$[\bar{x}-z_{\alpha/2}\frac{\sigma}{\sqrt{n}},\bar{x} +z_{\alpha/2}\frac{\sigma}{\sqrt{n}}]$
$X_i \sim N(\mu,\sigma^2)$ $\sigma^2$ unknown	$[\bar{x}-t_{\alpha/2}(n-1)\frac{s}{\sqrt{n}},\bar{x} +t_{\alpha/2}(n-1)\frac{s}{\sqrt{n}}]$
$\mu,\sigma^2<\infty$ $\sigma^2$ $n$	$[\bar{x}-z_{\alpha/2}\frac{\sigma}{\sqrt{n}},\bar{x} +z_{\alpha/2}\frac{\sigma}{\sqrt{n}}]$ (approximate)
$\mu,\sigma^2<\infty$ $\sigma^2$ $n$	$[\bar{x}-z_{\alpha/2}\frac{s}{\sqrt{n}},\bar{x} +z_{\alpha/2}\frac{s}{\sqrt{n}}]$ (approximate)

Remark: $\bar{X}$ $X_i$ symmetric, unimodal, and of the continuous type $X_i$ is highly skewed, a larger sample size is needed for the approximation to be reasonably accurate.

$X_i$ $\bar{X}$ .

Confidence Intervals for the difference of two means

Learning objectives

Understand the difference of independent and paired random samples
$1-\alpha$ confidence interval for the difference of two population means when we have
1. $(X_1,\dots,X_{n_X})$ $(Y_1,\dots,Y_{n_Y})$ $N(\mu_X,\sigma_X^2)$ $N(\mu_Y,\sigma_Y^2)$ $\sigma_X^2$ $\sigma_Y^2$ .
2. $(X_1,\dots,X_{n_X})$ $(Y_1,\dots,Y_{n_Y})$ $N(\mu_X,\sigma^2)$ $N(\mu_Y,\sigma^2)$ $\sigma^2=\sigma_X^2=\sigma_Y^2$ $\sigma_X^2 \neq \sigma_Y^2$ )
3. $(X_1,\dots,X_{n_X})$ $(Y_1,\dots,Y_{n_Y})$ $n_X$ $n_Y$ .
4. paired $(X_1,Y_1),\dots,(X_n,Y_n)$

Two independent random samples vs a paired random sample

Suppose a researcher wants to study whether lack of sleep impacts cognitive performance.

Protocol 1: $X_i$ $i$ $Y_i$ $i$ th participant from the second group.

Protocol 2: $X_i$ $i$ $Y_i$ $i$ th participant .

$i$ $X_i$ $Y_i$ $X_i$ $Y_i$ $i$ paired $X_i$ $Y_i$ should be paired. Therefore,

$(X_1,\dots,X_{10})$ $(Y_1,\dots,Y_{10})$ .
$((X_1,Y_1),\dots,(X_{10},Y_{10}))$ .

Confidence intervals for the difference of two means

1. Confidence interval for the difference of the population mean from two independence normal random samples with known variances

Example The researcher followed the first protocol and obtained the following data:

Group	Scores
Group 1 (normal sleep)	8.4, 9.2, 8.2, 10.6, 9.3, 8.2, 9.5, 9.7, 9.6, 8.7
Group 2 (awake for 24 hours)	8.5, 7.4, 6.4, 4.8, 8.1, 7.0, 7.0, 7.9, 7.8, 7.6

$X$ $Y$ $1$ . The researcher wants to form a 95% confidence interval for the difference of test scores between two groups.

$\mu_X$ $\mu_Y$ $1-\alpha$ $l(X_1,\dots,X_n)$ $u(X_1,\dots,X_n)$ such that

P_\mu(l(X_1,\dots,X_{n_X},Y_1,\dots,Y_{n_Y}) \leq \mu_X-\mu_Y \leq u(X_1,\dots,X_{n_X}, Y_1,\dots,Y_{n_Y})) = 1-\alpha.

We follow the four steps to find an interval estimator:

$\bar{X}-\bar{Y}$ $\mu_X-\mu_Y$ .
$\bar{X}-\bar{Y}$ $N(\mu_X - \mu_Y, \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y})$ $\bar{X}$ $\bar{Y}$ )
The pivotal quantity:
$\frac{\bar{X}-\bar{Y}-(\mu_X - \mu_Y)}{\sqrt{ \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}}} \sim N(0,1)$
$z_{\alpha/2}$ such that
$P(-z_{\alpha/2}\leq \frac{\bar{X}-\bar{Y}-(\mu_X - \mu_Y)}{\sqrt{ \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}}} \leq z_{\alpha/2}) = 1-\alpha$
$P(|(\bar{X}-\bar{Y})-(\mu_X - \mu_Y)|\le z_{\alpha/2} \sqrt{\frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}}) = 1-\alpha$ .
Rearranging the terms in the event,
$P( (\bar{X}-\bar{Y})-z_{\alpha/2}{\sqrt{ \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}}} \le \mu_X - \mu_Y\le (\bar{X}-\bar{Y})+z_{\alpha/2}{\sqrt{ \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}}}) = 1-\alpha$

Therefore,

$\mu_X-\mu_Y$ $\alpha$ $[\bar{X}-\bar{Y}-z_{\alpha/2}\sqrt{ \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}},\bar{X}-\bar{Y} +z_{\alpha/2}\sqrt{ \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}}]$ .
$1-\alpha$ $\mu_X-\mu_Y$ :
$[\bar{x}-\bar{y}-z_{\alpha/2}\sqrt{ \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}},\bar{x}-\bar{y} +z_{\alpha/2}\sqrt{ \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}}]$

A 95% confidence interval for the difference of test scores between group 1 and group 2 is

$[\bar{x}-\bar{y}-z_{0.025}\sqrt{ \frac{1}{10}+\frac{1}{10}},\bar{x}-\bar{y} +z_{0.025}\sqrt{ \frac{1}{10}+\frac{1}{10}}]= [1.013, 2.767]$

In R,


xxxxxxxxxx
group1 = c(8.4, 9.2, 8.2, 10.6, 9.3, 8.2, 9.5, 9.7, 9.6, 8.7)
group2 = c(8.5, 7.4, 6.4, 4.8, 8.1, 7, 7, 7.9, 7.8, 7.6)
xbar = mean(group1) # compute the sample mean of group 1 data
ybar = mean(group2) # compute the sample mean of group 2 data
margin = qnorm(1-0.025)*sqrt(1/10 + 1/10) 
CI_for_diff = c(xbar-ybar - margin, xbar-ybar + margin) # 95% CI
CI_for_diff
[1] 1.013477 2.766523

2. Confidence interval for the difference of the population mean from two independence normal random samples with unknown variances

Example: $1$ . The prior studies indicate that the variances of the scores from each group are likely to be the same. The researcher wants to form a 95% confidence interval for the difference of test scores between two groups.

$\sigma^2 = \sigma_X^2 = \sigma_Y^2$

In Step 3, we used

\frac{\bar{X}-\bar{Y}-(\mu_X - \mu_Y)}{\sqrt{ \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}}} \sim N(0,1)

$\sigma^2 = \sigma_X^2 = \sigma_Y^2$ $\mu_X-\mu_Y$ $\sigma^2$ with a pooled estimator of the common variance

S_p^2 = \frac{\sum_{i=1}^{n_X} (X_i-\bar{X})^2 + \sum_{i=1}^{n_Y} (Y_i-\bar{Y})^2}{n_X-1+n_Y-1}.

Lemma:

\frac{\bar{X}-\bar{Y}-(\mu_X - \mu_Y)}{\sqrt{ \frac{S_p^2}{n_X}+\frac{S_p^2}{n_Y}}} \sim t(n_X+n_Y-2).

proof.
$Z = \frac{\bar{X}-\bar{Y}-(\mu_X - \mu_Y)}{\sqrt{ \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}}} \sim N(0,1)$ $U$ such that
$U = \frac{(n_X+n_Y-2)S_p^2}{\sigma^2} =\sum_{i=1}^{n_X} \frac{(x_i-\bar{x})^2}{\sigma^2} + \sum_{i=1}^{n_Y} \frac{(y_i-\bar{y})^2}{\sigma^2} \sim \chi^2(n_X-1+n_Y-1)$
By theorem 5.5-3 in HTZ,
$\frac{\bar{X}-\bar{Y}-(\mu_X - \mu_Y)}{\sqrt{ \frac{S_p^2}{n_X}+\frac{S_p^2}{n_Y}}} = \frac{Z}{\sqrt{U/(n_X+n_Y-2)}}\sim t(n_X+n_Y-2).$

Following similar steps,

$\mu_X-\mu_Y$ $1-\alpha$ $[\bar{X}-\bar{Y}-t_{\alpha/2}(n_X+n_Y-2)\sqrt{ \frac{S_p^2}{n_X}+\frac{S_p^2}{n_Y}},\bar{X}-\bar{Y} +t_{\alpha/2}(n_X+n_Y-2)\sqrt{ \frac{S_p^2}{n_X}+\frac{S_p^2}{n_Y}}]$ .
$1-\alpha$ $\mu_X-\mu_Y$ :
$[\bar{x}-\bar{y}-t_{\alpha/2}(n_X+n_Y-2)\sqrt{ \frac{s_p^2}{n_X}+\frac{s_p^2}{n_Y}},\bar{x}-\bar{y} +t_{\alpha/2}(n_X+n_Y-2)\sqrt{ \frac{s_p^2}{n_X}+\frac{s_p^2}{n_Y}}]$

Therefore, a 95% confidence interval for the difference of test scores between group 1 and group 2 is

$[\bar{x}-\bar{y}-t_{0.025}(18)\sqrt{ \frac{s_p^2}{10}+\frac{s_p^2}{10}},\bar{x}-\bar{y} +t_{0.025}(18)\sqrt{ \frac{s_p^2}{10}+\frac{s_p^2}{10}}]= [1.023, 2.757]$

In R,


xxxxxxxxxx
group1 = c(8.4, 9.2, 8.2, 10.6, 9.3, 8.2, 9.5, 9.7, 9.6, 8.7)
group2 = c(8.5, 7.4, 6.4, 4.8, 8.1, 7, 7, 7.9, 7.8, 7.6)
xbar = mean(group1) # compute the sample mean of group 1 data
ybar = mean(group2) # compute the sample mean of group 2 data
sp2 = (sum((group1-xbar)^2) + sum((group2-ybar)^2))/18 # pooled sample variance
margin = qt(1-0.025,df = 18)*sqrt(sp2/10 + sp2/10) 
CI_for_diff = c(xbar-ybar - margin, xbar-ybar + margin) # 95% CI
CI_for_diff
[1] 1.022947 2.757053

$\sigma_X^2 \ne \sigma_Y^2$

$\sigma_X^2$ $\sigma_Y^2$ $S_X^2$ $S_Y^2$ in (2) and use

W=\frac{\bar{X}-\bar{Y}-(\mu_X - \mu_Y)}{\sqrt{ \frac{S_X^2}{n_X}+\frac{S_Y^2}{n_Y}}}

as a pivotal quantity.

$W$ $[r]$ degrees of freedom (Welch, 1949) where

r = \frac{(S_X^2/n_X+S_Y^2/n_Y)^2}{(n_X-1)^{-1}(S_X^2/n_X)^2+(n_Y-1)^{-1}(S_Y^2/n_Y)^2}.

Using Welch's approximation,

$\mu_X-\mu_Y$ $1-\alpha$ $[\bar{X}-\bar{Y}-t_{\alpha/2}(r)\sqrt{ \frac{S_X^2}{n_X}+\frac{S_Y^2}{n_Y}},\bar{X}-\bar{Y} +t_{\alpha/2}(r)\sqrt{ \frac{S_X^2}{n_X}+\frac{S_Y^2}{n_Y}}]$ .
$1-\alpha$ $\mu_X-\mu_Y$ :
$[\bar{x}-\bar{y}-t_{\alpha/2}(r)\sqrt{ \frac{s_X^2}{n_X}+\frac{s_Y^2}{n_Y}},\bar{x}-\bar{y} +t_{\alpha/2}(r)\sqrt{ \frac{s_X^2}{n_X}+\frac{s_Y^2}{n_Y}}]$

3. Confidence interval for the difference of the population mean from two independence random samples with unknown distributions

Example The researcher decided to follow the first protocol to study the effect of sleep deprivation. The researcher first looked at some of previous literature and found out that some of the scores from previous studies had a skewed distribution. Worried about possible non-normality of test scores, the researcher recruited 100 participants (50 participants per group) and carried out experiments. Plotting histograms of test scores from each group, the researcher concluded that test scores are not likely to be from normal distributions. The researcher wants to form a 95% confidence interval for the difference of test scores between two groups.

Group	Sample mean	Sample variance
Group 1	9.114	0.811
Group 2	6.963	0.918

$(X_1,\dots,X_{n_X})$ $(Y_1,\dots,Y_{n_Y})$ $n_X$ $n_Y$ .

From an application of a version of CLT, we have

\frac{\bar{X}-\bar{Y}-(\mu_X - \mu_Y)}{\sqrt{ \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}}} \dot\sim N(0,1)\tag{2}

and

\frac{\bar{X}-\bar{Y}-(\mu_X - \mu_Y)}{\sqrt{ \frac{S_X^2}{n_X}+\frac{S_Y^2}{n_Y}}} \dot\sim N(0,1)\tag{3}

$S_X^2 \approx \sigma_X^2$ $S_Y^2 \approx \sigma_Y^2$ $n_X$ $n_Y$ are sufficiently large.

Using (2) and (3),

$1-\alpha$ $\mu_X-\mu_Y$ with known variances:
$[\bar{x}-\bar{y}-z_{\alpha/2}\sqrt{ \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}},\bar{x}-\bar{y} +z_{\alpha/2}\sqrt{ \frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}}]$
$1-\alpha$ $\mu_X-\mu_Y$ with unknown variances:
$[\bar{x}-\bar{y}-z_{\alpha/2}\sqrt{ \frac{s_X^2}{n_X}+\frac{s_Y^2}{n_Y}},\bar{x}-\bar{y} +z_{\alpha/2}\sqrt{ \frac{s_X^2}{n_X}+\frac{s_Y^2}{n_Y}}]$

Therefore, an approximate 95% confidence interval for the difference of test scores between group 1 and group 2 is

$[9.114-6.963-z_{0.025}\sqrt{ \frac{0.811}{50}+\frac{0.918}{50}},9.114-6.963+z_{0.025}\sqrt{ \frac{0.811}{50}+\frac{0.918}{50}}]$

4. Confidence interval for the difference of the population mean from a paired random samples

Example A researcher wants to study whether lack of sleep impacts cognitive performance. The researcher recruited 10 participants. Each participant is asked to take the tests twice: one after a normal sleep and the other after being kept awake for 24 hours.

	1	2	3	4	5	6	7	8	9	10
First test (normal sleep)	8.1	9.5	7.2	11.6	9.9	7.3	10	10.7	10.4	8.5
Second test (awake for 24 hours)	7.0	8.6	6.3	10.7	8.8	6.3	8.9	9.1	9.0	7.5

Suppose it is reasonable to assume that the difference of test scores is normally distributed.

$X_i$ $Y_i$ $\bar{X}$ $\bar{Y}$ are dependent (Observe that a participant who scored high in the first test tended to score high in the second test).

$D_i = X_i-Y_i$ $X_i$ $Y_i$ $E[D_i] = \mu_D = \mu_X-\mu_Y$ $1-\alpha$ $\mu_D$ $D_1,\dots,D_n$ $D_i \sim N(\mu_D,\sigma_D^2)$ $\sigma_D^2$ is unknown, we can use a confidence interval based on the t distribution.

Therefore, the 95% confidence interval for the difference is

$[\bar{d}-t_{\alpha/2}(n-1)\frac{s}{\sqrt{n}},\bar{d} +t_{\alpha/2}(n-1)\frac{s}{\sqrt{n}}]=[\bar{d}-t_{0.025}(9)\frac{s}{\sqrt{10}},\bar{d} +t_{0.025}(9)\frac{s}{\sqrt{10}}] =[0.935, 1.265]$

$\bar{d}$ $s$ are sample mean and variance of differences of test scores.

In R,


xxxxxxxxxx
first = c(8.1, 9.5, 7.2, 11.6, 9.9, 7.3, 10, 10.7, 10.4, 8.5)
second = c(7, 8.6, 6.3, 10.7, 8.8, 6.3, 8.9, 9.1, 9, 7.5)
diff = first - second # compute the difference of scores
dmean = mean(diff)# sample mean
dvar  = var(diff) # sample variance
margin = qt(1-0.025,df = 9)*sqrt(dvar/10) 
CI_for_diff = c(dmean - margin, dmean + margin) # 95% CI
CI_for_diff
[1] 0.9347954 1.2652046

Summary

$(x_1,\dots,x_{n_X})$ $(y_1,\dots,y_{n_Y})$ $(X_1,\dots,X_{n_X})$ $(Y_1,\dots,Y_{n_Y})$

Settings	$1-\alpha$ confidence interval
$X_i \sim N(\mu_X,\sigma_X^2)$ $Y_i\sim N(\mu_Y, \sigma_Y^2)$ $\sigma_X^2, \sigma_Y^2$ known	$(\bar{x}-\bar{y})\pm z_{\alpha/2}\sqrt{\frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}}$
$X_i \sim N(\mu_X,\sigma_X^2)$ $Y_i\sim N(\mu_Y, \sigma_Y^2)$ $\sigma^2 = \sigma_X^2 = \sigma_Y^2$ unknown	$(\bar{x}-\bar{y})\pm t_{\alpha/2}(n_X+n_Y-2)\sqrt{\frac{s_p^2}{n_X}+\frac{s_p^2}{n_Y}}$
$X_i \sim N(\mu_X,\sigma_X^2)$ $Y_i\sim N(\mu_Y, \sigma_Y^2)$ $\sigma_X^2 \ne \sigma_Y^2$ unknown	$(\bar{x}-\bar{y})\pm t_{\alpha/2}(r)\sqrt{\frac{s_X^2}{n_X}+\frac{s_Y^2}{n_Y}}$ $r$ s the df from Welch's approximation
$n_X$ $n_Y$	$(\bar{x}-\bar{y})\pm z_{\alpha/2}\sqrt{\frac{\sigma_X^2}{n_X}+\frac{\sigma_Y^2}{n_Y}}$ $(\bar{x}-\bar{y})\pm z_{\alpha/2}\sqrt{\frac{s_X^2}{n_X}+\frac{s_Y^2}{n_Y}}$
$n = n_X=n_Y$ $X_i - Y_i \sim N(\mu_X-\mu_Y,\sigma_D^2)$ $\sigma_D^2$ unknown	$(\bar{x}-\bar{y})\pm t_{\alpha/2}(n-1)\sqrt{\frac{s_D^2}{n}}$
$n = n_X=n_Y$ $n$	$(\bar{x}-\bar{y})\pm z_{\alpha/2}\sqrt{\frac{s_D^2}{n}}$

where

$s_p^2$ is a pooled sample variance estimate,
$s_X^2$ $(X_1,\dots,X_{n_X})$ ,
$s_Y^2$ $(Y_1,\dots,Y_{n_Y})$ ,
$s_D^2$ $(D_1,\dots,D_n)$ $D_i = X_i - Y_i$ .

Confidence intervals for proportions

Learning objectives

$1-\alpha$ confidence interval for the population proportion
$1-\alpha$ confidence interval for the difference of population proportions

Confidence interval for one proportion

$X_1,\dots,X_n \sim {\rm Ber}(p)$ $1-\alpha$ $p$ $n$ $p = E[X_i]$ $i=1,\dots,n$ ). By CLT, we have,

\frac{\bar{X}-p}{\sqrt{p(1-p)/n}} \dot\sim N(0,1).

$\frac{\bar{X}-p}{\sqrt{p(1-p)/n}}$ $\bar{X}$ $p$ (and does not depend on any unknown parameter).

$c=z_{\alpha/2}$ such that

P(-c \le \frac{\bar{X}-p}{\sqrt{p(1-p)/n}}\le c) \approx 1-\alpha

$p$ .

1. Wald Confidence Intervals:

$\bar{X} \approx p$ $n$ $\bar{X}$ $p$ ),

\frac{\bar{X}-p}{\sqrt{\bar{X}(1-\bar{X})/n}} \dot\sim N(0,1).

Therefore, we have,

P(-z_{\alpha/2} \le \frac{\bar{X}-p}{\sqrt{\bar{X}(1-\bar{X})/n}}\le z_{\alpha/2}) \approx 1-\alpha

In other words,

P(\bar{X}-z_{\alpha/2}\sqrt{\bar{X}(1-\bar{X})/n} \le p \le \bar{X}+z_{\alpha/2}\sqrt{\bar{X}(1-\bar{X})/n}) \approx 1-\alpha

$p$ $1-\alpha$ $[\bar{X}-z_{\alpha/2}\sqrt{\frac{\bar{X}(1-\bar{X})}{n}},\bar{X} +z_{\alpha/2}\sqrt{\frac{\bar{X}(1-\bar{X})}{n}]}$ .
$1-\alpha$ $p$ $[\bar{x}-z_{\alpha/2}\sqrt{\frac{\bar{x}(1-\bar{x})}{n}},\bar{x} +z_{\alpha/2}\sqrt{\frac{\bar{x}(1-\bar{x})}{n}]}$ .

2. Wilson Confidence Intervals:

P(-z_{\alpha/2} \le \frac{\bar{X}-p}{\sqrt{p(1-p)/n}}\le z_{\alpha/2}) = P(|\frac{\bar{X}-p}{\sqrt{p(1-p)/n}}|\le z_{\alpha/2})=P(|\frac{\bar{X}-p}{\sqrt{p(1-p)/n}}|^2\le z_{\alpha/2}^2)

Rewriting, we get

$n\frac{(\bar{X}-p)^2}{p(1-p)}\leq z_{\alpha/2}^2$ , and

$(\bar{X}-p)^2-z_{\alpha/2}^2p(1-p)/n\leq 0.$

In other words,

P( (1+\frac{z_{\alpha/2}^2}{n})p^2 - (2\bar{X} +\frac{z_{\alpha/2}^2}{n})p +\bar{X}^2 \le 0 ) \approx 1-\alpha

By the quadratic formula, we can show that

P(\frac{\bar{X}+ \ z_{\alpha/2}^2/(2n)- z_{\alpha/2}\sqrt{\bar{X}(1-\bar{X})/n+z_{\alpha/2}^2/(4n^2)}}{1+z_{\alpha/2}^2/n} \le p \le \frac{\bar{X}+ \ z_{\alpha/2}^2/(2n)+ z_{\alpha/2}\sqrt{\bar{X}(1-\bar{X})/n+z_{\alpha/2}^2/(4n^2)}}{1+z_{\alpha/2}^2/n}) \approx 1-\alpha

$p$ $1-\alpha$ :
$[\frac{\bar{X}+ \ z_{\alpha/2}^2/(2n)- z_{\alpha/2}\sqrt{\bar{X}(1-\bar{X})/n+z_{\alpha/2}^2/(4n^2)}}{1+z_{\alpha/2}^2/n} , \frac{\bar{X}+ \ z_{\alpha/2}^2/(2n)+ z_{\alpha/2}\sqrt{\bar{X}(1-\bar{X})/n+z_{\alpha/2}^2/(4n^2)}}{1+z_{\alpha/2}^2/n} ]$
$1-\alpha$ $p$ :
$[\frac{\bar{x}+ \ z_{\alpha/2}^2/(2n)- z_{\alpha/2}\sqrt{\bar{x}(1-\bar{x})/n+z_{\alpha/2}^2/(4n^2)}}{1+z_{\alpha/2}^2/n} , \frac{\bar{x}+ \ z_{\alpha/2}^2/(2n)+ z_{\alpha/2}\sqrt{\bar{x}(1-\bar{x})/n+z_{\alpha/2}^2/(4n^2)}}{1+z_{\alpha/2}^2/n} ]$

Remarks:

asymptotically $n$ $(1-\alpha)$
- $n$ $\ z_{\alpha/2}^2/(2n)$ $z_{\alpha/2}^2/(4n^2)$ $z_{\alpha/2}^2/n$ are small, thus Wilson and Wald confidence intervals are approximately equal.
$n$ $p$ near 0 or 1, the Wald confidence interval performs badly
$[0,1]$ $[0,1]$ (which doesn't make sense for proportions)
The Wilson confidence interval often has better coverage probability than the Wald interval
For these reasons, the Wilson confidence interval is sometimes recommended over the Wald confidence interval

Confidence interval for the difference of proportions

independent $X_1,\dots,X_{n_X} \sim {\rm Ber}(p_1)$ $Y_1,\dots,Y_{n_Y}\sim {\rm Ber}(p_2)$ $1-\alpha$ $p_1-p_2$ $n_X, n_Y$ are large.

From an application of a version of CLT, we have

\frac{\bar{X}-\bar{Y}-(p_1 - p_2)}{\sqrt{ \frac{{\rm Var}(X)}{n_X}+\frac{{\rm Var}(Y)}{n_Y}}} \dot\sim N(0,1)

and

\frac{\bar{X}-\bar{Y}-(p_1 - p_2)}{\sqrt{ \frac{\bar{X}(1-\bar{X})}{n_X}+\frac{\bar{Y}(1-\bar{Y})}{n_Y}}} \dot\sim N(0,1).

Therefore,

$p_1-p_2$ $1-\alpha$ $[\bar{X}-\bar{Y}-z_{\alpha/2}\sqrt{ \frac{\bar{X}(1-\bar{X})}{n_X}+\frac{\bar{Y}(1-\bar{Y})}{n_Y}},\bar{X}-\bar{Y} +z_{\alpha/2}\sqrt{ \frac{\bar{X}(1-\bar{X})}{n_X}+\frac{\bar{Y}(1-\bar{Y})}{n_Y}}]$ .
$1-\alpha$ $p_1-p_2$ :
$[\bar{x}-\bar{y}-z_{\alpha/2}\sqrt{ \frac{\bar{x}(1-\bar{x})}{n_X}+\frac{\bar{y}(1-\bar{y})}{n_Y}},\bar{x}-\bar{y} +z_{\alpha/2}\sqrt{ \frac{\bar{x}(1-\bar{x})}{n_X}+\frac{\bar{y}(1-\bar{y})}{n_Y}}]$ .

Example $100$ $25$ $100$ $30$ $p_M$ $p_W$ $p_M-p_W$ .

$X_i$ $Y_i$ be the preference of the ith women.
$X_i \sim {\rm Ber}(p_M)$ $Y_i \sim {\rm Ber}(p_W)$ $\bar{x} = .25, \bar{y} = .3$ .
$p_M- p_W$ is
$\begin{align} &[\bar{x}-\bar{y}-z_{\alpha/2}\sqrt{ \frac{\bar{x}(1-\bar{x})}{n_X}+\frac{\bar{y}(1-\bar{y})}{n_Y}},\bar{x}-\bar{y} +z_{\alpha/2}\sqrt{ \frac{\bar{x}(1-\bar{x})}{n_X}+\frac{\bar{y}(1-\bar{y})}{n_Y}}]\\ &=[(-0.05)-1.96\sqrt{ \frac{.25(.75)}{100}+\frac{.3(.7)}{100}},(-0.05)+1.96\sqrt{ \frac{.25(.75)}{100}+\frac{.3(.7)}{100}}]\\ &=[-0.174, 0.074] \end{align}$

Interval Estimation

Introduction to Interval Estimation

Learning objectives

Interval estimators and interval estimates

Steps to construct an interval estimator for \theta using a pivotal quantity

Confidence intervals for one mean

Learning objectives

Confidence intervals for one mean

Summary

Confidence Intervals for the difference of two means

Learning objectives

Two independent random samples vs a paired random sample

Confidence intervals for the difference of two means

Summary

Confidence intervals for proportions

Learning objectives

Confidence interval for one proportion

Confidence interval for the difference of proportions

$\theta$ using a pivotal quantity